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Let X\ , . . . , X n be i.i.d. random observations. Let § = L + T be a 
[/-statistic of order k > 2 where L is a linear statistic having asymp- 
totic normal distribution, and T is a stochastically smaller statistic. 
We show that the rate of convergence to normality for S can be sim- 
ply expressed as the rate of convergence to normality for the linear 
part L plus a correction term, (varT) ln 2 (varT), under the condition 
ET 2 < oo. An optimal bound without this log factor is obtained under 
a lower moment assumption E|T| a < oo for a < 2. Some other related 
results are also obtained in the paper. Our results extend, refine and 
yield a number of related-known results in the literature. 



1. Introduction. There has been a vast literature related to normal ap- 
proximations and to the rates of convergence to normality for [/-statistics 
of order k>2. Undoubtedly, the case for k = 2 has been most studied and 
is best understood, so we will start our discussion with this case as well. Let 
X,X\,X2, ■ ■ ■ ,X n , n > 2, be independent and identically distributed (i.i.d.) 
random variables (r.v.'s). Define a [/-statistic of order 2 by 

v ' l<t<j<n 

where the kernel h(x,y) is a real- valued Borel measurable function, symmet- 
ric in its arguments with Eh(X\,X2) = 0. By the Hoeffding decomposition, 
we have 

2a g = ^- g T.^)+ E ^Xj) 

y v y l= i v v ) y l<j<7<n 
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(1 ' 2) 

= L + T, 

where L and T are the linear and quadratic terms, respectively, and 

g(x) = E[h{x,X 2 )\ - 6, a] = Var[ 9 (X 1 )], 

r](x, y) = h(x, y) - g(x) - g(y) + 9. 

Throughout the paper, it is assumed that a 2 G (0,co). 

For any generic r.v. Y, denote its distribution function (d.f.) by Fy(x) 
F(Y<x). Then 



It is well known that F^ + f(x) converges to the standard normal d.f., denoted 
by $0), provided Eh 2 {Xi,X 2 ) < oo (see [11]). In fact, this moment condi- 
tion can be reduced to Eg 2 (X\) < oo and E\h(X l ,X 2 )\ 4/3 < oo (see Remark 
4.2.4 of [15], page 131). There also has been much work on the convergence 
rates to normality for [/-statistics of order 2. However, the sharpest Berry- 
Esseen bound of order O^" 1 / 2 ) comes from [14] and [9], who establish the 
following ideal bound: 

/ \ „ 1 fE\gi\ 3 E\m 2 \ 5/3 

(1.3) | -*| <-= + 



Vn\ <J 3 a b J 3 
where the following notation has been used: 

• gi = g(Xi) and rjij = r)(Xi,Xj). 

• For any function / :R— >-R, define ||/|| = sup^gjj . 

• By a <^.b for a, b > we mean a < cb for some positive constant c not 
depending on the underlying distribution function. 

Indeed, Bentkus, Gotze and Zitikis [4] showed that the moment conditions 
in (1.3), -E|(7i| 3 < oo and E\rji 2 \ 5 / 3 < oo, are the weakest possible in the 
Berry-Esseen bounds of order 0{n~ l l 2 ) for [/-statistics of order 2. 

Our main purpose of this paper is to extend the optimal results on Berry- 
Esseen bounds for U -statistics of order 2 to those of higher orders. In stark 
contrast with studies on [/-statistics of order 2, there is a very limited lit- 
erature on optimal or near-optimal error bounds for [/-statistics of higher 
orders. In this paper, we intend to fill in the gap. The work is of both 
theoretical and practical value since many symmetric statistics may be ap- 
proximated arbitrarily closely by [/-statistics of sufficiently high order under 
appropriate conditions. As an application, we will derive a near-optimal er- 
ror bound for Studentized [/-statistics of order 2 in Section 3.3. 
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At first sight, it appears to be an easy task to extend the optimal Berry- 
Esseen bounds from second-order [/-statistics to higher order ones. (At least 
we naively thought so in the beginning.) However, a close inspection soon 
leads one to believe that this is far from trivial. We note that the usual 
approach of dealing with U -statistics is to first use the Hoeffding decompo- 
sition to turn the statistic of interest into a sum of sequentially smaller and 
uncorrelated terms, and then to use truncation techniques to each of these 
terms. Broadly speaking, there are two main difficulties with this approach 
which we must overcome. Specifically, we have the following goals in mind. 

(A) A direct and transparent methodology is needed. 

The truncation techniques are manageable for second-order [/-statistics. 
However, as the order gets larger, it becomes more and more intangible 
and unworkable. In order to treat U -statistics of higher order than 2, in 
this paper, we may have to abandon the idea of truncation, and instead 
choose a more direct and more transparent approach. 

(B) A simple and unified form of error bounds is needed. 

For second-order [/-statistics, the Hoeffding decomposition produces a 
linear statistic L plus a degenerate [/-statistic T in (1.2). As a result, 
the two terms involving i?|<?i| 3 and £'|r/i2| 5//3 in (1.3) are from the lin- 
ear term L and the degenerate [/-statistic term T, respectively. More 
generally, applying the Hoeffding decomposition to U -statistics of order 
k > 2 leads to a sum of k terms. Consequently, the resulting optimal 
error bound would contain k terms which would become more compli- 
cated as k gets larger. In this paper, we strive to provide simple and 
general error bounds for U -statistics of general orders. 

Let us now look at (B) in more detail. Again, we will examine the case of 
[/-statistics of order 2 first. A closer look at the proof of (1.3), as in [14] and 
[9], shows that a more refined Berry-Esseen bound can be given as follows: 



Using some truncation arguments, it can be shown that (1.4) is equivalent 
to 



Note that the last term in (1.5) shows the interaction effect between L 2 and 
T in (1.2). This suggests that if we take the correlation between L and T 
into account, we might be able to improve the error bound. Indeed, if we let 



(1.4) 




(1.5) 




N 2 (x) = $(x) + -=k 2 &"(x) where k 2 = Eg^m/tf 
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be the adjusted normal approximation, Alberink and Bentkus [2, 3] show 
that the following optimal error bound holds: 

(1-6) -Hl+t - N 2 \\ < —;= 3 1 o — 

(see Theorem 1 of [2, 3]). We reiterate that the error bound given in (1.5) is 
optimal, and implies the optimal bound (1.3) as a special case by a simple 
truncation argument. 

Finally, we can rewrite (1.5) in a very simple form 

(1.7) \\F h+ f-N 2 \\ < pi-*||+varT. 

In other words, the error bounds in the adjusted normal approximations for 
[/-statistics of order 2 are simply the error bounds for the dominant linear 
part L plus the variance of the remaining "error" term T. 

It is somewhat surprising that the optimal error bound for [/-statistics of 
order 2 can be written in such a simple and compact form given by (1.7). 
The appearance of the variance term varT in (1.7) is appealing since the 
variance is the most natural and easy-to-interpret measure to describe the 
effects of the error term T. Furthermore, from a computational point of view, 
it is easier to calculate the variance of T than some other moments of T. 

The purpose of this paper is to derive optimal or near-optimal error 
bounds in the normal approximations for [/-statistics of order k>2, similar 
to those given in (1.7). We emphasize that the truncation techniques are 
heavily used in [2, 3] for second-order [/-statistics, and will not work for 
U -statistics of higher orders. 

The paper is arranged as follows. In Section 2, some definitions and nota- 
tion will be given. In Section 3, we establish optimal or near-optimal error 
bounds for [/-statistics of order k > 2. In Section 4, we give some approxima- 
tions to expectations of smooth functions of U -statistics and some of their 
applications. Possible extensions to other statistics are discussed in Section 
5. We provide the details of the proofs in Section 6. 

2. Definitions and notation. Let X, X\, . . . ,X n be a sequence of i.i.d. 
r.v.'s. Suppose that a symmetric statistic S = S(Xl, . . . ,X n ) can be decom- 
posed into 

(2.1) S = L + T, 

where L and T are symmetric [/-statistics, L of order 1 and T of order k, 
respectively. This means that L is of the form 

n 

(2.2) IL = ]TL 4 , 

i=l 
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and T is of the form, by way of Hoeffding decomposition, 

n n n 

T = E^+ E TV + ..-+ J2 r * 

i=l Ki<i<n Kii<'<ifc<n 

(2.3) 



dcf , 



Ti + ■•■ + ¥*, 

where Lj = l(X{) and T^..^. = ^(X^, . . . ,Xi k ) are Borel measurable func- 
tions, and ifc is invariant under permutation of variables. Without loss of 
generality, it is assumed that 

(2.4) EL = ET = and varL=l, 

which can always be achieved after re-centering and re-scaling the relevant 
statistics. Typically, L is the dominant term and T can then be called the 
error term. 

Now let us define, for a £ [1,2], 

k 



f3 = nE\L 1 \ i , 7 = varT, 



7 



(a) 



E 

P =i 



E\Tx... t 



Note that 



(2.5) 



E|L| 3 <nE|Li| 3 = /3, 



EITI 



E ET ?=E (1 Wi 



i=l 



i=l 



: 7, 



E|T| Q < fc a_1 J^E|T i | a «2 ( H ) E l Tl 



i=l i=l 

where the second inequality in (2.5) follows by first applying an inequality 
(2.18) from [10], 

a/2 



E\Ti\ a <C(a,i)M 



E 



T 



n—3i 



E 



3i-3i 



l<ji< — <ji<n 



l<ji<—<ji<n 

with C(a,i) a constant depending only on a and i, and then applying the 
following simple inequality (noting a/2 < 1): 

a/2 

< E \Tn-n\ a - 

l<jl<-<ji<n 

Therefore, j3, 7 and 7^ are closely related to -ElLl 3 , ET 2 and E|T| a , re- 
spectively. Furthermore, 7 = 7^). 

Finally, the following convention will be adopted throughout the paper: 
conditions appearing in a statement are implied implicitly in that statement. 
For instance, in (3.4), the conditions /3 < 00 and 7 < 00 are assumed even 
though (3.4) still holds true without these moment conditions. 
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3. Error bounds for [/-statistics. Asymptotic normality of the linear 
term L and its rates of convergence are well known. For instance, the latter 
is described by the Berry-Esseen bound: 

(3.1) ||F L -$||«/3. 

If T — > in probability, we have ||-F§ — $|| — > as n — > oo. Following the 
discussion in the Introduction, we are interested in the error bound for ||F§ — 
$|| in the form of 

(3.2) ||F S - iV p || « ||F L - $|| + /(E|T| a ) « + /( 7 W) 

for some function /, N p for some p > (to be defined below), and a < 2. In 
particular, when a = 2, (3.2) becomes 

\\F$ - N p \\ « \\F L - $|| + /(varT) < + /( 7 ). 

In other words, we wish to investigate the question: if a linear statistic L is 
perturbed by some error term T, how do we correct the usual Berry-Berry 
bound for normal approximations, using only the variance of T? 

The adjusted normal approximation N p [which was mentioned in (3.2)] 
can be defined as follows. Define Nq(x) = $(x) and 

(3.3) N p (x) = $(x) + G 1 (x) + --- + G p (x), p=l,2,..., 
where, for p = 1, . . . , k, 

k p = cov(L p , T) = EL P T = ( ELi • ■ • L p T, 

Gp(x) = (-ir +1 Kp^ +1 \x). 
Two questions immediately arise from the above definition: 

1. Under what conditions can we guarantee the existence of k p for p> 1? 
By the Holder inequality, we have 

E|Li •••/,;¥! < (E|Li| 3 ---E|Li| 3 ) 1/3 (E|T| 3/2 ) 2/3 . 

Therefore, k p exists if E|T| 3 / 2 < oo as we assume that E|Lj| 3 < oo for 
1 < i < n. If E|T| 3 / 2 = oo, we will see later that we only use Nq(x) = $(x). 

2. How does one interpret N p ? 

The adjusted normal approximation N p plays a central role in this paper. 
For [/-statistics of order 2 which first appeared in [2] , we have K\ = and 
k 2 = -n- 1 / 2 Eg 1 g 2 r] 12 /o-g, hence 

A/ 2 (x) = $(x)-^^if^$'"(x). 
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Note that N2 is different from its corresponding Edgeworth expansion, 

Fl+tW ^ W --L(^ + ^)^ w+o( „-^), 

which holds under the following optimal conditions: (i) the distribution 
of g(Xi) is nonlattice; (ii) E^il 3 < oo and E|t7i 2 | 5 / 3 < oo (see [12]). On 
the other hand, iVp's and Edgeworth expansions are similar in the sense 
that both try to improve the normal approximations by including some 
higher order correction terms. We point out that Edgeworth expansions 
do require some smoothness conditions while Berry-Esseen bounds do 
not. 

Of course, the adjusted normal approximations N p are not introduced 
to compete with Edgeworth expansions. Rather, they are convenient tool 
for obtaining a neat and unified error bound in the normal approximation 
for U -statistics of general order. 

In the following, we will give error bounds under two different sets of 
conditions: ET 2 < oo, and E|T| a < oo for a £ [1,2), respectively. Since the 
Hoeffding decomposition of T requires E|T| < oo, we will not consider the 
case E|T| a < oo for a E (0,1). 

3.1. Error bounds under ET 2 < oo. Here is our key result. 

Theorem 1. We have 
(3.4) ||F§-Ay «/? + 7 ln 2 ( 7 ), 

where the last term 7 In 2 (7) is taken to be zero when 7 = 0. 

For second-order [/-statistics, Alberink and Bentkus [2, 3] show 

Pfc-iVfcll «/3 + 7, 

which is sharper than that in Theorem 1 when k = 2. We conjecture that 
the logarithmic factor in Theorem 1 can be removed for [/-statistics of order 
k > 2. Actually, we have derived an error bound without log factors which, 
for the [/-statistics of order 2, implies the result of [2, 3]. Unfortunately, for 
k > 3, the error bound involves certain conditional variances, and the proof 
of this bound is indeed very complicated, compared to the proof of Theorem 
1. We do not provide this bound here since, in applications, this bound 
has only minor advantages, compared to the simple and indeed convenient 
bound of Theorem 1. 

Applying Theorem 1, we can derive the following optimal error bound 
for ||.F§ — $|| for [/-statistics of order k. (A similar theorem to (3.5) is also 
derived in [7], using Stein's method.) 
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Theorem 2. We have 



(3.5) 



F s -$|| </3 + 7 



1/2 



Furthermore, the exponent 1/2 in (3.5) is the best possible for U -statistics 
of order k>2. 

Comparing Theorems 1 and 2, we see that the adjusted normal approxi- 
mation Nk(x), which takes into account of the correlations between L fc and 
T, does show improvement over the standard normal distribution 

3.2. Error bounds under E|T| a < oo for a £ [1,2). Theorem 1 contains 
a log factor which will be inherited when applying the result. We will pro- 
vide an optimal error bound without the log factor under a lower moment 
assumption E|T| a < oo for a 6 [1, 2). 

To do this, we define Ao = 3? and 



where p = p(a,k) is the largest integer such that p < (a — l)/(2 — a) and 
p < k. For example, if a < 3/2, then N p = If 3/2 < a < 5/3 and k > 1, 
then N p = $ + Gi. If 5/3 < a < 7/4 and k > 2, then N p = $> + G 1 + G 2 . 

Theorem 3. ForaG [1,2), let p = p(a,k) be defined above, we have 



Theorem 3 would be of most interest when a is close to 2 in applications. 
As an example, we will apply Theorem 3 below to obtain a near-optimal 
Berry-Esseen bound for Studentized ^/-statistics of order 2. 

3.3. An application to Studentized U -statistics of order 2. The optimal 
error bound for standardized ^-statistics of order 2 was given in (1.3). We 
conjecture that the same optimal error bound applies to their corresponding 
Studentized ^/-statistics defined by 



N p = $ + Gi + • • • + G, 



F s -N p \\<.f3 + 7 



where a 2 g 




2 



However, we have so far only managed to prove a near-optimal result, as 
given in Theorem 4 below. Although this result is the best available bound 
in the literature, it falls a little short of the optimal bound (i.e., e = 0). 
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Theorem 4. For any e > 0, we have 

The proof of Theorem 4 is very involved, and hence will not be given 
here. Its proof can be obtained from the authors. The proof basically involves 
approximating § by a [/-statistic of sufficiently high order and then applying 
nontrivial truncation techniques. Currently we are trying to eliminate e in 
Theorem 4, and hope to be able to report on this in the near future. 

4. Error bounds for expectations of smooth functions. Here, we give 
some approximations to expectations of smooth functions of [/-statistics. 
These results (e.g., Theorems 5 and 6) are of interest in their own right. 
There are also other useful applications, for example, in approximating char- 
acteristic functions. For instance, Corollaries 1 and 2 have been used in the 
proofs of Theorems 1 and 3, respectively [see (6.6) and (6.22)]. 

Throughout this section, let / : R — > R be a sufficiently smooth function. 



Theorem 5. We have 



f(u)dF{S<u}- j f(u)dN k {u) 

JR 



«C(/3 + 7 ), 



where C = \\f"\\ + ■ ■ • + ||/( fc + 4 ) || . 



Choosing f(u) = exp{itu} in Theorem 5, we obtain a useful inequality for 
the characteristic functions. 



Corollary 1. We have 

EexpjitS}— / exp{itu} dNk(u) 



«(|t| + |t| fc+4 )(/3 + 7)- 



Theorem 5 and Corollary 1 can be extended to the case of the lower 
moment assumption E|T| Q < oo with a < 2. However, the technical details 
are more involved in Theorem 6 than those in Theorem 5. 



Theorem 6. For a G [1,2), we have 

f(u)dF{S<u} - f f{u)dN p (u) 



« C{(5 + 7 



where p = p(a,k) is defined in Theorem 3, and C = \\f'\\ + \\f"'\\ for a = 1 
and C=||/'|| + --- + ||/( fc+4 )|| for a G (1,2). 
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Choosing f(u) = exp{itu} in Theorem 6, we get: 
Corollary 2. For a £ [1,2), we have 



EexpjitS}— / exp{itu} dN p (u) 



«c(t)(/3 + 7 ( a ); 



where p = p{a, k) is defined in Theorem 3, and C(t) = \ t\ + \t\ 3 for a = l and 
C(t) = \t\ + \t\ k+i for a £ (1,2). 



5. Extensions. One may consider extending the work on the error bounds 
from [/-statistics to more general class of statistics. Consider a nonlinear 
statistic W = W(X\, . . . , X n ), based on a sequence of the r.v.'s X\, . . . , X n . 
Examples include symmetric statistics, [/-statistics with non-i.i.d. observa- 
tions, L-statistics, Studentized statistics, exchangeable statistics, finite pop- 
ulation statistics, and so on. Error bounds for symmetric statistics and non- 
symmetric statistics were considered by [17] and [7]. Alberink [1] studied 
error bounds for [/-statistics with non-i.i.d. observations. 

Suppose that we can linearize W into 

W = L + A, 

where L is a linear statistic and A is an error term. One way to linearize W is 
by the Hoeffding decomposition. We have seen how it works for U -statistics 
in this paper. For symmetric statistics, we refer to [17]. The Hoeffding de- 
compositions for independent observations and nonsymmetric statistics was 
discussed by [13]. For orthogonal decomposition of finite population statis- 
tics, see [6]. Hoeffding decompositions for exchangeable statistics were con- 
sidered by [16]. 

If L is asymptotically normal and A — > in probability, W is asymptoti- 
cally normal. Using truncation and the Chebyshev inequality, we can easily 
get the following error bound: 

\\F W - $|| < \\F h - $|| + (1 +p)[(\\<S>'\\)/pf( 1+p \E\A\ p ) 1 /( 1+ P\ 

See (1.3) of [7]. By taking p = 2, we get 

(5.1) \\F W - $|| < \\F h - $|| + 2(var A) 1 / 3 . 

For symmetric statistics, the next example shows that the exponent 1/3 
in (5.1) is already the best possible statistic. The example is similar to an 
example of [7]. 

Example 1. Take X,Xi,...,X n to be i.i.d. N(0,1) r.v.'s. Define 

(5.2) h = n- 1/2 (X 1 + ---+X n ), A = -e(|L|- a -E|L|- a ), 
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where < a < 1/2 and e > 0. Then W = L + A is a symmetric statistic with 
finite variance since EA 2 < 2e 2 E|L|~ 2a < oo. Now if we have 

(5.3) ||-Fw-$|| <c||F L -*|| + c(varA)' ? 
or 

(5.4) \\F W - $|| < cE|X| 3 /\/™ + c(var Af, 
then the exponent $ must satisfy i? < 1/3 . 

Proof. Clearly, L~JV(0,1). So ||F L - $|| = sup z |P{L < x} = 0. 

Thus it suffices to show that the weaker inequality (5.4) implies i? < 1/3. 
Also note that var A = c\£ 2 where c\ = var(|L|~ a ) is some positive constant 
depending only on a. It follows that — <&|| and var A in this example 
do not depend on n. Letting n — ► oo in (5.4) yields 

(5.5) ||-Pw-*|| < c(var Af = ccfe™ . 
On the other hand, for sufficiently small e > 0, we have 

||F W -$|| = sup|P{W<:c}-$(z)| 

X 

> P{W < dE|L|~ a } - P{L < eE|L|~ a } 

(5.6) =P{L|L| a <e}-P{L<eE|L|- a } 

= P{0 < L a+1 < e} - P{0 < L < eE|L|~ a } 

> o^lW - c 3 e, 

where c 2 and C3 are positive constants depending only on o. The inequalities 
(5.5) and (5.6) imply ccfe™ > c 2 e 1/(a+1) - c 3 e, that is, 

(5.7) ccie^- 1 /^>c 2 -c 3 e a ^ a+1 \ 

If 2-d - l/(o + 1) > 0, then letting e [ in (5.7) would imply that > c 2 > 0, 
a contradiction. Hence we must have 2i? — l/(a + 1) < for all < a < 1/2. 
Letting a 1 1/2, we have 2tf - 2/3 < 0, i.e., ^ < 1/3. □ 

6. Proofs of Theorems 1-6. We first prove several useful lemmas. 

Lemma 1. We have: 

(a) l<y^/3; 

(b) nE|Li|« < f3 q ~ 2 for2<q<3; 

(c) \k s \ < ^fs< ^/l for s = 1, ... ,k; 

(d) \k s \ < p Ss + 7 i a) < (3 Ss + w/iere 3/2 < a < 2, 5 = (2 - a)/(a - 1), 
and s is an integer such that 1 < s < A A;. 
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PROOF. We will use Holder's inequality: E\XY\ < \(E\X\ a ) 1 / a (E\Y\ b ) 1 / b 
with a > 1 and 1/a + 1/6 = 1. Recall that EL 2 = nELj = 1. 

(a) 1 = (rm- 1 ) 3 / 2 = (nEL 2 ) 3 / 2 = n 3 / 2 (EL 2 ) 3 / 2 < 7^ 3 / 2 K|Z, 1 | 3 = ^K(3. 

(b) nE|Li|« = nElLip^lLiI 3 ^" 2 ) < n(EL 2 ) 3 -<?(E|Li| 3 )<'- 2 = . 

(c) \k s \ < (Z)E\L 1 --.L,T 1 ... a \ < C)(EL 2 ...EL 2 ET 2 ..JV2 = { q < 

l/ 2 ^ 1/2 

(d) We use Holder's inequality with exponents a = a and 6 = a/(a — 1) and 
another basic inequality x 1 / a y 1 / b < x + y for x,y>0. Also note that 
nE|Li| g < P q ~ 2 from (b), and g - 2 = 5. We have 

\ Ka \< Of)E\L 1 --.L.T 1 ....\< ^) (E|^i|')* / *(E|Ti„. 8 | a ) 1 /« 
< Q (E|Li| 9 ) s + ^^E|Ti... s | Q <(nE|Li| g ) s + 7^ 

< / g(9- 2 )* + 7 ( a )= J g*» + 7 ( a ). □ 

Lemma 2. For sufficiently smooth functions g:M— >M, one has 

(6.1) n^ + ^-^Bdbii + ib'iDH 1 / 2 . 

Proof. The result follows from multiplying the obvious inequalities 
\\g{x + h) - g(x)\\ <2\\g\\ and \\g(x + h) - g(x)\\ < \\g'\\\h\, 
and taking a square root, and then applying y/2ab < a + b. □ 

6.1. Proof of Theorem 1. Denote A = f \\F§ — N^\\. We consider two sep- 
arate cases: 

Case I: max{/3, 7l , . . . ,7 fe } > c fc ; 

Case II: max{/3, 7 i, . . . , 7 fc} < c&, where > is a constant depending 
only on k to be determined later. 

Case I is relatively easy to prove. For case II, we take a classical approach. 
That is, using the well-known Esseen's smoothing inequality, we reduce the 
problem to the estimation of some characteristic functions. 

Proof for case I. We shall prove that A <C (3 + 7, which clearly im- 
plies the desired A <C j3 + 7 In 2 (7). From Lemma 1, we have k 2 < 7 p . Using 

this inequality, estimating P{8 < x} < 1 and |$^(a;)| <C 1, we derive 

(6.2) A < 1 + \ Kl \ + • • • + |« fc | < 1 + y/r[+ • • • + < 1 + yfy. 

We consider the alternative cases 7 > 1 and 7 < 1 separately. If 7 > 1 , then 
using (6.2) we have A <C < 7 < (3 + 7. If 7 < 1, then (6.2) implies Ac 1. 
Using the condition, max{/3, 71, ... , 7^} > c&, we have 

A < 1 < cl x max{/3, 7l , . . . , 7fc } < 1 (/? + 71 + • • • + 7fc ) < (3 + 7. 
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Proof for case II. We can assume that n is sufficiently large so that 

n > l/c fc . 

Indeed, for n < l/c k the bound of Theorem 1 holds trivially. To see this, we 
note that, by (c) of Lemma 1, we have k 2 < j p . Since max{/3,7i, . . . ,7^} < c k 
by assumption, we then have At 2 < "y p < c k < n^ 1 < 1. Hence, it follows that 

|iV fc (x)| <l + |/ci| + --- + |K fc | <1 + A;<1. 

Therefore A <C 1. Using 1 < \/n/? (see Lemma 1), for small n < 1/ck, we 
have 1 <C 0. Inequalities A< 1 and 1 <C /3 imply A <C /3 <C /3 + 7 which is 
somewhat better than the bound of Theorem 1. 

By Lemma 1, ft 2 < 7 P which combined with the condition, max{/3, 71, ... , 
7fc} < Cfc, implies that k 2 <C 1 for 1 <p < k. Thus the function N k (x) has a 
bounded derivative, 

(6.3) \N' k (x)\«l + \K 1 \ + --- + \K k \«l. 

Due to (6.3), to estimate A we can apply Esseen's smoothing inequality (see, 
e.g., [8], Chapter XVI, Lemma 3.2). For any a > we have 

(6.4) A«i+ f ItrVW-sWId*, 

a 7- a 

where 

(6.5) /(i) = Eexp{itS}, g(t) = [ exp{itx} dN k (x). 

Note that = (1 + Ki(it) 3 H h K k (it) k+1 ) exp{-t 2 /2}. We choose 



Since max{/3,7i, . . . ,7^} < c k , we have (3 <c k and 7 < fccfc, resulting in 
a > s/ck~/(c k H h c fe ) = 1/((1 + fe)Vcfc) > 1, 

if Cfc is chosen small enough, for example, c k < (I + k)~ 2 . 

Split the integral in (6.4) as /" Q = i|j|<c* + /<7<|t|<a' wnere C = C k is a 
sufficiently large positive constant depending only on k to be chosen later. 

To estimate S\t\<c we use Corollary 1 of Section 4, 

(6.6) \ f( t)- g (t)\^(\t\ + \t\ k+i ){f3 + 1 ). 

It follows that Jw <( 2 ^ + 7i a bound which is somewhat better than the 
desired bound with 7 In 2 (3 + I/7) in place of 7. 

It remains to consider the integral /c<| t i< a - Introduce the characteristic 
functions 1? = #(t) and g = g(t), 

(6.7) ■& = Eexp{itL 1 }, g = exp{-t 2 /(2?i)}. 
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By Lemma 3 below, we have, for 4k <m < n/4, 

(6.8) |/(t)-5WI«l*| fc+4 (l^r + £' m )(/3 + 7) + 7i 2 W« far |t| > 1. 
We shall use the well-known simple bound 

(6.9) |0| < exp{-i 2 /(4ra)} for \t\ < 1/0. 

In particular, the inequality (6.9) holds for C <\t\ < a. 
We choose the integer number 4k < m < n as 

m= [4(k + 5)nt~ 2 \n\t\], C < \t\ < a, 

where [x] is the integer part of x G R. The number m = m(t) depends on 
t. If C = Cfc is sufficiently large, then m is a well defined integer such that 
4k < m < n/4, for sufficiently large n. Now (6.8) and (6.9) imply 

(6.10) |/(t)- 5 (t)K|t|- 1 (/3 + 7) + 7 ln|t|, C<\t\<a. 

Integrating (6.10) over C < \t\ < a we derive Jc<\t\<a ^ + 7 + 7ln 2 a. To 
conclude the proof of the theorem, we note that l<a<3 + l/7 due to our 
choices of constants, and therefore 7 + 7 In 2 a <C 7 In 2 (7). 

We now prove the following lemma, which is used in the proof above. The 
lemma gives an expansion of the characteristic functions for \t\ > 1. 

Lemma 3. Assume that (3 < 1 and 7 < 1. Then for n>4k and 4k < m < 
n/4 we have 

\f(t) - g(t)\ « \t\ k+4 (\V\ m + Q m )(P + 7) + Jt 2 m/n for \t\ > 1. 

Proof. The proof follows from Propositions 1-3 below. When applying 
Propositions 1 and 2 one has to replace m by 2m. □ 

Let us introduce some notation first. Let Q m = {1, . . . , m} and let A be a 
subset of Q n = {1, . . . ,n}, and use |^4| to denote the number of elements in 
A. For convenience, we write Tj^,,.^ or Ti 1 ...{ k instead of Tu lt ... t i k \. Then we 
can write T = Ti + h where, for 1 < p < k, 

t p= Y. t a with T A = tp(Xj ,jeA). 

\A\=p 

We now split T into two parts T^ m ), T^ ) so that 

T = T (m) + T (0) , T (m) = T^ m) H h T[, m) , T (0) = T^ 0) H h Tj^ , 

where 

T (m) = ^ Ta and T (0) = T A . 
\A\=s,Anfl m ^0 \A\=s,AnCl m =0 
We are now ready to prove Propositions 1-3. 
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Proposition 1. Let 1 < m < n. The characteristic function 
f = f{t) = Eexp{ii§} = Eexp{it(L + + T< ro >)} 
satisfies f = fo + f\ + R with 

(6.11) / = Eexp{it(L + T(°))}, h = (ii)E(exp{ii(L + j(°))}tW) 
and a remainder term R such that \R\ <^jt 2 m/n. 

Proof. We use r, To, t\,T2, ■ ■ ■ to denote i.i.d. r.v.'s which are uniformly 
distributed on [0,1], and further assume that they are independent of all 
other r.v.'s. Given a smooth function g, we will frequently use the Taylor 
expansion of the following form 

k 

g (x + h)=Y, 9 {s) (x)h s /s\ + E(l - r) V fe+1 > + rh)h k+1 /k\. 

For instance, we can expand / in powers of T*" 1 ) , and obtain f = fo + fi + R 
with 

R = (it) 2 E(l - t) exp{it(L + + rT( m ))}(T( m )) 2 . 

Thus 

\R\ < t 2 E(T^) 2 « t 2 £ mnP-^T 2 .^ = t 2 - ]T n^T 2 .. p « -t 2 7 . 

p=i p=i 1 

PROPOSITION 2. Let 2k < m < n. The function f + f\ with f and fi 
defined by (6.11) satisfies fo + fi = fi + fs + Ri with f2 = EexpjiiL}, = 
E(ii) exp{iiL}T, and the remainder term R\ satisfies \R\\ <C 7i 2 | , #| m / 2 . 

Proof. Since T = + T( m ) , it suffices to check that 

(6.12) / ~/ 2 + (^)Eexp{itL}T (0) , f x ~ (^)Eexp{itL}T (m) 

with remainder terms bounded as R\. 

Let us prove the first relation in (6.12). Note that 

fo = i9 m Eexp{it(L(°) + T^)} where = L m+1 + ■ ■ ■ + L n 

is the part of L independent of X\ , . . . , X m . Now we can expand in powers 
of T( ). We estimate the remainder term similar to the proof of Propo- 
sition 1. To estimate the variance of T^ ^ we use the obvious inequality 
varl^ ) < var T. 

Let us prove the second relation in (6.12). We consider only the case where 
m is an even integer. It suffices to check that 

(6.13) Eexp{it(L + T (0) )}T(, m) ~ Eexp{itL}T£ m) for p = 1, . . . , k. 
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Let us show that 

(6.14) Eexp{it(L + T(°))}T{, m ) =Eexp{it(L + T(°))}T; 

with T* = Y? s =l S(s) a sTA and a s = (™) / ( m / 2 ) , where J2(s) ^ s taken over all 
subsets A C £l n \ ^ m /2 such that \Af] (f2 m \ £l m /2)\ = s. 

To prove (6.14), let us start with a representation of Eexp{ii(L + T^°))}Tp m ^ . 
Sorting subsets A according to the cardinality of the intersection Ad {1, ... , m} 
and using the symmetry and i.i.d. assumptions, we get 

Eexp{^(L + TW )}¥]>) 

(6.15) 



53 ET^exp^'iQL + T^)} 

A: |A|=p,AnC m ^0 



= £ E ET A exp{it(L + T(°))} 

s=U: |A|=p,|AnQ m |=s 

= E(T) £ ET AsUB exp{it(L + T(°))}, 

s=l ^ ' B: B|=p— s,|Bnf2 m |=0 

where Ai, . . . ,A S C {1, . . . , m} are arbitrary fixed subsets of cardinality s. 
For example, A s = {1, . . . , s}. A consideration similar to (6.15), starting 

with Eexp{it(L + T(°))}T; instead of Eexp{it(L + T( ))}T p m) , shows that 
Eexp{rt(L + T^)}T* is equal to the right-hand side of (6.15), which proves 
the identity (6.14). 

The statistic T* is independent of X±, . . . ,X m / 2 - Therefore, (6.14) implies 

(6.16) Eexp^L + T^jTjf) = tf m / 2 Eexp{ii(L( m / 2) +T^)}T*. 

Now we can expand in powers of T(°). This leads to 

Eexp{it(L + T^)}T^ ~ # m / 2 Eexp{ifl>/ 2) }T; = Eexp{itL}T( m) 

up to an error bounded by t 2 \$\ m / 2 E\T^T*\ [this can be checked as we did 
for (6.15)]. Using the Holder inequality, we have 

E | T (0) T *| < (varTpVarT*) 1 / 2 <varT = 7, 
since varT* <C varT p < varT. Combining the bounds, we get (6.13). □ 

Proposition 3. Assume that \t\ > 1. Let j3 < 1 and 7 < 1. Let n>Ak 
and 1 < m < ra/4. Then the function = EexpjiiL} + (ii)Eexp{iiL}T sat- 
isfies = g + R2 with g(t) = (1 + Ki(it) 3 + • • • + Hk-i(it) k+1 )g n and the 
remainder term R2 such that |i?2| <C (/3 + j)\t\ k+A (Q m + |#| m ). 
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Proof. Using the i.i.d. and symmetry assumptions, we can write = 
$ n + it E{ =1 & n ~ p QD p , where 

Dp = E,T\... p exp{itLi} ■ ■ ■ exp{itL p } 

= (it) p ELi ■ ■ ■ L p Ti... p exp{it(TiLi H h t p L p )}. 

In view of the inequalities | expjitu} — 1| <C \/\v\ and yju\ + • • • + u p <C 
y/ui + • • • + y/Up~ for u s > 0, and \f\t\ < \t\ for \t\ > 1, and the assumptions 
nE|Li| 3 = /3 and EL 2 = 1/n, we have 




= P (^itrv2 E | Ll |3/ 2|L2 ... LpTi ^ 

«|t| fe+1 v ^n^/ 2 (^(ET 1 2 .. p ) 1 /2 

«|i| fc+1 v / ^v / 7 
< N fc+1 (/? + 7). 

This in turn leads to U = fs + #3 with f 5 = tf n + £p =1 (^) p+1 K P ?? n - p and 
the remainder term R 3 such that |i? 3 | < |i| fc+2 |#| m (/3 + 7). 

Recall 1 < y/n/3. By a simple Taylor expansion, we have /3 < 1, and — 
1| -C t 2 EL 2 = t 2 /n < t 2 (3 2 < t 2 p. Hence \^ n ~P - iT| < i 2 /3|tf| m . Therefore, 
using k p < y^y < 1 , we can write fs = f& + R4 with 

and a remainder term R4 such that |i?4| <C |t| fc+3 |i?| m (/3+7). A very standard 
calculation shows that 

(6.17) \$ n - g n \<z:\t\ 3 (\$\ m + Q m )f3. 

Thus using again k p < y^y < 1, it follows that f$ = g{t) + R§ with \R§\ <C 
|i| fc+4 (|#| m + £ m )/3. Collecting all the inequalities, we complete the proof. 

□ 
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6.2. Proof of Theorem 3. The proof is similar to that of Theorem 1. 
Recall that 7^ = (™)E|Ti... s | a . Without loss of generality we assume that 

(6.18) (3<c k and 7^ < c k for s = 1, . . . , jfc; 
and 

(6.19) n>l/c k , 

where c k is a sufficiently small positive constant. A reduction leading to 
(6.18) and (6.19) is based on applications of |k s | <f3 5s +y { s a) < (3 Ss + 
(see Lemma 1) instead of k 2 < 7 P used in the proof of Theorem 1. Using 
(6.18), (6.19) and (d) of Lemma 1, it is easy to check that the function N p 
has a bounded derivative. Thus we can apply Esseen's smoothing inequality. 
For any a > 0, 

(6.20) ||F§-Ay|«i + f \t\- l \f(t)-g^(t)\dt 

a J-a 

with / defined by (6.5) and 

(6.21) 9 {a) {t)= f exp{itx}dN p {x). 

We choose a = v / q:/(/3 + 7 (q) ). Split the integral in (6.20) as J" a = L <c + 
fc<\t\<a' wri ere C = C k is a sufficiently large positive constant depending 
only on k and a to be chosen later. 

To estimate J^ <c we use Corollary 2 of Section 4. Estimating \t\ < C, 
the corollary implies 

(6.22) \f(t)-g<- a) (t)\<\t\ a (J3W a) )- 

It follows that fu^c ^ P + 7^- Note that the presence of the factor \t\ a 
guaranties the convergence of the integral in a neighborhood of t = 0. 
It remains to consider the integral fc<\t\<a- By Lemma 4 below, we have 

\f(t) - (t)\ < |t| fc+4 (|^r + Q m )(P + 7 (a) ) + 7 (Q) \t\ a m/n 

for |t| > 1 and 4k < m < n/4. We integrate this bound as in the proof of The- 
orem 1. A small difference arises since now instead of y\t\ 2 m/n we have the 
summand 7^) \t\ a m/n. The choice of m ~ nt _2 ln|i| leads to y^\t\ a m/n ~ 
7^ a ) |i| a_2 In \t\ which is an integrable function with respect to the measure 
dt/\t\ at |t| = 00 since we assume that a < 2. As a consequence, we now no 
longer have any log factors. 

We now prove the following lemma, which is used in the proof above. This 
lemma extends Lemma 3 to the case of lower moment assumption. 
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Lemma 4. Let 1 < a < 2. Assume (3 < 1 and 7 < 1. T/ien for n> 4k and 
4k <m< n/4, we have 

\f(t)-g^(t)\ « \t\ k + 4 (\$\ m + e m )([3 + 1 ^) +1 ^\t\ a m/n for \t\ > 1, 
where g, f and g( a ' are given by (6.7), (6.5) and (6.21), respectively. 

Proof. We consider the cases a = 1 and 1 < a < 2 separately. First 
consider the case a = 1. Using exp{itx} = 1 + 0(\tx\ a ), the expansion of 
Proposition 1 is replaced by f = fo + R with a remainder R such that 
\R\ <C 7^ \t\ a m/n. The function fo can be represented as 

fo = Eexp{it(L + T^)} = tf m Eexp{rf(L(°) + T^)}. 

Thus by an expansion in powers of T^ ) we can replace it by EexpjiiL} = i9 n . 
We conclude the proof replacing -d n by g n = g = g( Q ) [see (6.17)]. 

Next consider the case 1 < a < 2. Using expjitx} = 1 + itx + 0(|tx| a ) and 
repeating arguments used in the proof of Proposition 1, we have / = /o + /1 + 
R with a remainder term such that \R\ <C ■j^\t\ a m/n. The functions fo, f\ 
are the same as in Proposition 1. A repetition of the proof of Proposition 2 
leads to fo + fi = h + /3 + #1 with |2?i| « t 2 ^\$\ m / 2 . A small difference 
in the proof is that now we have to use Taylor expansions and Holder's 
inequalities adjusted to our lower moment assumption. 

To conclude the proof we have to show that fi + f% = g^ + R2 with a 
remainder term R 2 such that \R 2 \ <C (0 + ^)t k+4 {g m + |t?| m ). The proof 
is similar to that of Proposition 3 provided that some adjustments related 
to the lower moment assumption are made. The flavor of the adjustments 
is like that used in the proof of Theorem 6. We omit formal exposition of 
indeed lengthy and technical details. □ 

6.3. Proof of Theorem 2. We prove the first half now. If 7 > 1, then we 
estimate \\F§ — <3>|| < 2, and it follows that \\F$ — &\\ <C (3 + 7. In the case of 
7 < 1, we use the bound of Theorem 1. Since \G S \ <C \k s \ and k 2 < 7 from 
Lemma 1, we have 

k 

\\Fs-$\\ < \\F s -N k \\+Y / \G s \<.(3 + lln 2 ^) + ^<.p + ^. 

s=l 

Next, we will show the second half of the theorem with an example. Let 
X, Xi, . . . , X n be i.i.d. iV(0, 1) r.v.'s, and define S = L + T, where 

1 n 2e 
IL = — ■=. 7 Xi , T = : — ■= / Xi Xi 

v i=l v ' v l<«<J<n 

with e > 0. Now we can show that if we have 
(6.23) ||F§-$|| </3 + 7 < ? , 
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then we must have $ < 1/2. 

We prove the above claim by contradiction. Namely, assume that the 
contrary holds, that is, that (6.23) holds with some $ > 1/2. Clearly, (3 = 
E|Xi| 3 /v^ « ™~ 1/2 , 7 = varT = C^n" 1 ), k x =0, k 2 = en,- 1 / 2 , Gi(x) = 0, 
and Gi(x) = en _1//2 $'"(x). In view of Theorem 1 and the assumption (6.23), 
we have 

|G 2 (x)| = ||G 2 || < \\F S - $|| + \\F S - $ - G 2 || < f3 + ^ + 7 ln 2 (7)- 
Multiplying y/n on both sides, we get 

\e&"(x)\ < 1 + + V^7ln 2 (7)- 

Letting n — > oo, and in view of 7 = 0{n~ l ) and d > 1/2, we get e|$"(a:)| -C 1 
which contradicts the assumption that e > is an arbitrary positive number. 
Thus we must have < 1/2. 

6.4. Proof of Theorem 5. Let rj be a r.v. from iV(0, 1). Note that the 
left-hand side of the inequality in Theorem 5 can be written as 

E/(S)= I f(u)dF{E<u}, 
Jr 

Ef(r,) + K 1 Ef"(r,) + ---+K k Ef ( ~ k + 1 \i 1 ) = [ f(u)dN k (u). 

Jr 

Expanding /(§) = /(L + T) in powers of T, we get 

E/(S) = E/(L) + E/'(L)T + ■&, d = E(l - r)/"(L + rT)T 2 , 

where r is uniformly distributed on [0,1], and is independent of all other 
r.v.'s. To prove the theorem it suffices to check that 

(6.24) ||E/(L)-E/(r ? )||«||r||/3, 

(6.25) ||ET/'(L) - 1|| <C(/3 + 7 ), 
(6-26) ||*|| < H/I7, 
where 

(6.27) / = Kl Ef"( V ) + ■■■ + Kfc E/( fc+1 ) ( V ) d = h + --- + I k . 

The estimate (6.24) is well known since L = L\ -\ h L n is a sum of i.i.d. 

r.v.'s (see, e.g., [5]). 

The bound (6.26) is obvious. Indeed, we have ||#|| < ||/"||ET 2 = ||/"|| 7 . 

It remains to prove (6.25). Using T = Ti + • • • + T/%, the linear structure 
of T s , and the i.i.d. assumption, we can write 

(6.28) ET/'(L) = Ji + -.- + J fc) J s = ( n )ET 1 ... s f '{!.). 
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In view of (6.27) and (6.28), the proof of (6.25) reduces to checking that 

(6.29) \\I S - J a ||«C(/0 + 7). 

Let us split L = W + R, where W = L\ + V L s and R = L s+ i H h L n . 

Writing ET S = « a E/( a+1 )(12), instead of (6.29), it suffices to prove that 

(6.30) ||/ s -K s ||«C(/3 + 7 ) 
and 

(6.31) \\K a - J s ||<C(/3 + 7 ). 

Proof of (6.30). Note that I s - K s = K s (Ef^ s+1 \rj) - Ef^ s+1 \R)). 
Since L = R + W, to prove (6.30) it suffices to show that 

(6.32) |k s |||E/( s+1 )( ?? )-E/( s+1 )(L)||«C(/? + 7 ) 
and 

(6.33) \k,\ \\Ef( s+1 \R + W)- Ef^ s+1 \R)\\ « C(J3 + 7). 

Let us prove (6.32). We consider the cases (3 > 1 and /3 < 1 separately. 
Using < 7 (see Lemma 1), in the case (5 > 1 we have 

In the case (3 < 1 we use the bound (6.24) replacing / by f( s+1 \ Since k, 2 < 7, 
we get 

\K s \\\Ef^ +1 \r,) - E/( S+1 )(L)|| « ||/( s+4 )|| ^7/3 < C^B < C(j + (3). 

Let us prove (6.33). We apply (6.1) replacing g by f^ s+1 \ the variable x 
by R, and h by W, respectively. Since n 2 < 7, we have 

I as, I ||E/( s+1 )(ii + W)- E/( s+1 )(i?)|| 

(6.34) 

^ (11^+1)11 + ^(,+2) || )7 l/2 E | W |l/2_ 

Using \W\ 1 ' 2 < |Li|V2 + . . . + iLsl 1 / 2 and EL 2 m = 1/n, we derive E\W\ 1 / 2 <C 
n -1 / 4 < /3 1 / 2 . In view of (6.34), an application of V7/3 < 7 + yields (6.33). 

Proof of (6.31). Using short Taylor expansions, we can represent J s 

as 

(6.35) J s = (^j ELt ■ ■ • /.,7i...,,r- ' 1 (V + R) 

with V = T\L\ + • — \- t s L s . Representation (6.35) can be proven in s steps. 
Let us consider details related only to the first step. The degeneracy property 
of the kernels implies that 

(6.36) ETi... s f'(L 2 + ■ ■ ■ + L n ) = 
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since the conditional expectation E(Ti... s |X2, . . . ,X s+ i) = 0, and L 2 + • • • + 
L n is independent of X\. Using (6.36), expanding in powers of L\ we get 

J s = ( U ) EL\T\... s f" {t\L\ + L 2 + • • • + L n ). 

Proceeding in a similar way with L 2 , . . . , L s in place of L\ , we arrive at 
(6.35). 

Using the definition of k s , and the expression (6.35) for J s , we can write 
K s -Js= C)ELi • ■■L s T 1 ... s (f {s+1) (R) ~ f {s+1 \V + R))-By an application 
of (6.1) with g = / (s+1) , x = R and h = V, it follows that 

(6.37) \\K 8 - J s \\ <c(^\e\L 1 ---L s T 1 ... s \\V\ 1 / 2 . 

Using the inequality (Ul 1 / 2 < l^il 1 / 2 + • • • + 1//2 , the Holder inequality, 
the assumption E|Li| 3 = (3/n and EL 2 = 1/n, we get 

E\L 1 ---L S T 1 ... S \\V\ 1/2 < sE\L 1 \^ 2 \L 2 ---L s T 1 ... s \ 

(6.38) < (E|Li| 3 L^--L 2 ET 1 2 .. s ) 1/2 

= n-^^ETf..,) 1 / 2 . 

Using Elf... s = ET 2 /Q <t/("), relations (6.37) and (6.38) yield \\K S - 
J s \\ <C Cv^v^T) a = C)/ nS - Noting that a < 1 and yf]3~f < /3 + 7, we derive 
||if s — J s \\ <C C(/3 + 7) which concludes the proof of (6.31). 

6.5. Proof of Theorem 6. Let rj be a r.v. from iV(0,l). Note that the 
left-hand side of the inequality in Theorem 6 can be written as 

E/(S)= f f(u)d¥{S<u}, 
m 

E/(n) + «iE/"fa) + • • • + « p E/^+ 1 )(t ? ) 
f(u)dN p (u). 



We consider the cases a = 1 and 1 < a < 2, separately. 
First consider the case a = 1. In this case 

/(<*) =0. Similar to (6.1), one 

can check that 

(6.39) /(S) = /(L)+0 with ||i?||<||/|| 1 - a ||/ / |r|T| a . 

It is easy to show that 

E|T P r« 7 w = (' ? ^E|r 1 ... P r, 

(6.40) 

E|Tr«7 (Q) =7i a) + --- + 7i a) - 
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Then the proof follows from (6.24), (6.39) and (6.40). 

Now consider the case 1 < a < 2. Similar to (6.1), we have 

/(§) = /(L) + /'(L)T + with « (H/'ll + ||/"||)|T| Q . 

From (6.40), we get E|T| Q <7 (a) . Also from (6.24) it follows easily that 
||E/(L) - Ef(r))\\ <C \\f"'\\p. It remains to show that 

(6.41) ||ET/'(L) - II < C(/3 + 7 (Q) ), 
where = KiE/"(r/) + • • • + K p E/(P +1 )(r/). We have 

(6.42) ET/'(L) = J x + • • • + J k , J s d ^ Q ETi... 8 /'(L). 

In view of (6.41) and (6.42), the proof of the theorem reduces to checking 

(6.43) ||/ s - J 8 \\ <C(/3 + 7 (a) ) forl<s<p, 
with I s = K s Ef( s+l \r]), and 

(6.44) || J, || <C(/3 + 7 (Q) ) forp<s<fc, 

where p is an integer satisfying the condition of the theorem. 

Estimation of J s for s > p. We have to prove (6.44). Consider the differ- 
ence operator S m such that S m f(x) = f(x + L m ) — f(x). Arguments similar 
to those used to derive (6.35) allow us to write 

(6.45) J s =( n s S jMT 1 ... s 6 1 ---S s f(R) 

with R = + • • • + L n . Similar to (6.1), we have 

• • • 5 s f'(R)\\ « \\f'\\ + ||/ (s+1) |||Li| e • • • \L S \ £ , < e < 1. 

From this and the Holder inequality with exponents q = ot/(a — 1) and a, 
so that 1/q + 1/a = 1, we have 

I|j s ||«cQe|l 1 | £ ---|l s | £ |Ti... s | 

< C Q (E|Li| £9 • • • |L a | B 9) 1 /9(E|Ti... s | a ) 1 /« 

= (E|L 1 | ef ') s y /9 (7( Q )) 1/a 
(E|Lir 9 ) s + C7^. 

To complete the proof of (6.44) we need to prove (™)(E|Li| e<? ) s <C 13. Choose 
e = (2 + l/s)/q. The condition s > p guaranties that e < 1. Using nE|Li| 2+1 / s < 
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f3 1 / s [see (b) of Lemma 1], we have Q (E|Li| £( *) s < (nE|Li| 2+1 / s ) s < (3, which 
completes the proof for s > p. 

Estimation of J s for 1 < s < p. We have to prove (6.43). There is some- 
thing to prove only if3/2<a<2. Indeed, in the case 1 < a < 3/2, the integer 
p from the condition of the theorem satisfies p < 0, and we have no J s to 
estimate. 

Let us split L = W + R where W = L\ -{ h L s and R = L s+1 H h 

L n . Writing K s = K s Ef( s+r >(R) instead of (6.43) it suffices to prove that 
\\I S — K s \\ and \\K S — J s \\ are bounded from above like ||/ s — J s \\ in (6.43). 

Estimation of \\I S - K s \\. Note that I s -K s = k s E(/( s+1 )(7?) - /( s+1 )(i?)). 
Since L = R + W, it suffices to show that 

(6.46) | Ks |||/( s+1 )(r ? )-/( s+1 )(L)||«C(/3 + 7 (a) ) 
and 

(6.47) \K s \\\Ef( s+1 XR + W) -Ef( s+1 \R)\\ <C(/3 + 7 (a) ). 

Let us prove (6.46). We consider the cases f5 > 1 and /5 < 1 separately. 
Using \k s \ <ti f3 5s + 7*°) and Ss < 1 [see (d) of Lemma 1], in the case of 
(3 > 1, we have 

|k s |||E/( s+1 )(?7) -E/( s+1 )(L)|| <2||/( s+1 )||(/? 5s + 7 ( c1 ))<C(/3 + 7( q) ). 

In the case of (3 < 1, we combine (d) of Lemma 1 with the bound (6.24) 
replacing / by We get 

\k s I ||E/( S+1 ) (rj) - E/( s+1 ) (L) || « || /( s+4 ) || {I3 5s + 7 {q) )/3 < C(/3 + 7 W )- 

Let us now prove (6.47). Again we consider the cases of j3 > 1 and /? < 1 
separately. We omit details in the case of j3 > 1 since they are very similar 
to those in the proof of (6.46). In the case of < 1, we use (d) of Lemma 1 
and apply \\f( s+1 \R + W) - Ef^ s+1 \R)\\ < C\\W\\ to get 

(6.48) \K s \\\Ef( s+l \R + W) -Ef s+1 \R)\\ ^C(p Ss + j^)E\W\. 

Using |W| < + • ■ • + |L 8 | and EL 2 m = 1/n, we derive E|W| « n~ 1 / 2 < f3. 
In view of our assumption /3 < 1, the inequality (6.48) yields (6.47). 
Estimation of \\K S — J s \\. By (6.35) we have 

J s = ( n ] ELx ■ ■ • /.,7-;..,.r- I: (U + fl) 

with U = riLi H hT s L s . This representation, combined with the definition 

k s , allows one to write 

K s - J s = Q ELi ■ • •L S T 1 ... S (/( S+1 )(,R) - /( S+1 )(U + fl)). 
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Again we consider the cases of J3 > 1 and (3 <1 separately. We omit details 
in the case of (5 > 1 since they are very similar to those in the proof of (6.46). 
In the case of < 1 , we apply 

||E/( s+1 )(i?) - f( s+1 \R + V)\\< C\\V\\ £ , < e < 1. 

From the inequality \V\ £ <C |^i| e + ••• + |^s| e > the i.i.d. assumption, the 
Holder inequality with exponents q = a/ {a — 1) and a, and also (") < n s , 
we have 

-J s \\<£c( n ] E|Li • • • L 8 Ti... s ||F| e 

< Cn s sE|Li| 1+£ |L 2 ---L s Ti... s | 
^Cn'QEILil^+^QEILil'J'-^^QEIT!...,!") 1 / 01 

< Cn a E|Li|( 1+e ^(E|Li| ff ) 5-1 + Cn s E\T h .. s \ a 
= Cn^ElLil^^EILiI 9 ) 8 - 1 + C^ a \ 

We choose e so that (1 + e)q = 3. In view of 3/2 < a < 2, the number e satis- 
fies < £ < 1. Since nE|Li| 3 = /?, to conclude the estimation of \\K S — J s \\ it 
suffices to verify the inequality raElLiI 9 < 1. Recalling that (5 < 1 and apply- 
ing Lemma 1, we have nElLiI 9 < /3 9-2 < 1, which completes the estimation 
of \\K S - J s \\. 
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